The drift diffusion model as the choice rule in reinforcement learning.

نویسندگان

  • Mads Lund Pedersen
  • Michael J Frank
  • Guido Biele
چکیده

Current reinforcement-learning models often assume simplified decision processes that do not fully reflect the dynamic complexities of choice processes. Conversely, sequential-sampling models of decision making account for both choice accuracy and response time, but assume that decisions are based on static decision values. To combine these two computational models of decision making and learning, we implemented reinforcement-learning models in which the drift diffusion model describes the choice process, thereby capturing both within- and across-trial dynamics. To exemplify the utility of this approach, we quantitatively fit data from a common reinforcement-learning paradigm using hierarchical Bayesian parameter estimation, and compared model variants to determine whether they could capture the effects of stimulant medication in adult patients with attention-deficit hyperactivity disorder (ADHD). The model with the best relative fit provided a good description of the learning process, choices, and response times. A parameter recovery experiment showed that the hierarchical Bayesian modeling approach enabled accurate estimation of the model parameters. The model approach described here, using simultaneous estimation of reinforcement-learning and drift diffusion model parameters, shows promise for revealing new insights into the cognitive and neural mechanisms of learning and decision making, as well as the alteration of such processes in clinical groups.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural mechanism for stochastic behaviour during a competitive game

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another agent. To understand the neural mechanism of such dynamic choice behaviour, we propose a biologically plausible model of decision making endowed with synaptic plasticity that follows a reward-dependent stochastic Hebb...

متن کامل

Title : Medial prefrontal cortex as an action - outcome predictor

In Simulation 6 in the main text, we simulated the PRO model in a 2-arm bandit task similar to a previously reported study. A reinforcement learning model was then fit to the trial-by-trial choice behavior of the PRO model in order to recover effective learning rates in stable and volatile periods. The reinforcement learning model is described by a learning law that tracks the value (V) of choi...

متن کامل

fMRI and EEG predictors of dynamic decision parameters during human reinforcement learning.

What are the neural dynamics of choice processes during reinforcement learning? Two largely separate literatures have examined dynamics of reinforcement learning (RL) as a function of experience but assuming a static choice process, or conversely, the dynamics of choice processes in decision making but based on static decision values. Here we show that human choice processes during RL are well ...

متن کامل

Connecting rule-abstraction and model-based choice across disparate learning tasks

Recent research has identified key differences in the way individuals make decisions in predictive learning tasks, including the use of featureand rule-based strategies in causal learning and model-based versus model-free choices in reinforcement learning. These results suggest that people rely to varying degrees on separable psychological processes. However, the relationship between these type...

متن کامل

Threshold Learning for Optimal Decision Making

Decision making under uncertainty is commonly modelled as a process of competitive stochastic evidence accumulation to threshold (the drift-diffusion model). However, it is unknown how animals learn these decision thresholds. We examine threshold learning by constructing a reward function that averages over many trials to Wald’s cost function that defines decision optimality. These rewards are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Psychonomic bulletin & review

دوره 24 4  شماره 

صفحات  -

تاریخ انتشار 2017